Extreme Value Statistics of Hierarchically Correlated Variables: Deviation from 
Gumbel Statistics and Anomalous Persistence 
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We study analytically the distribution of the minimum of a set of hierarchically correlated ran- 
dom variables Ei, E%, . . ., En where Ei represents the energy of the i-th path of a directed polymer 
on a Cayley tree. If the variables were uncorrelated, the minimum energy would have an asymp- 
totic Gumbel distribution. We show that due to the hierarchical correlations, the forward tail of 
the distribution of the minimum energy becomes highly non universal, depends explicitly on the 
distribution of the bond energies e and is generically different from the super-exponential forward 
tail of the Gumbel distribution. The consequence of these results to the persistence of hierarchi- 
cally correlated random variables is discussed and the persistence is also shown to be generically 
anomalous. 



PACS numbers: 02.50.-r, 05.40.-a 

The extreme value statistics of random variables is im- 
portant in various branches of physics, statistics, and 
mathematics 0-0] . For example, in the context of disor- 
dered systems, the thermodynamics at low temperatures 
is governed by the statistics of the low energy states. 
The statistics of extremal quantities also play impor- 
tant roles in binary search problems in computer science 
H . The extreme- value statistics is well understood when 
the random variables are independent and identically dis- 
tributed. In this case, depending on the distribution of 
the random variable, three different universality classes 
of extreme value statistics are known J3| . Recently there 
has been an attempt to identify these different universal- 
ity classes with the different schemes of replica symmetry 
breaking j^]. A natural question is: what are the univer- 
sality classes when the random variables are correlated? 
This question has recently been addressed ||D and it has 
been conjectured that this class of problems corresponds 
to the full replica symmetry breaking S . To answer this 
important question, it would thus be useful to derive ex- 
act results for the extreme value statistics of correlated 
variables, whenever possible. 

More precisely, let us consider a set of N random vari- 
ables Ex, E 2 , • • ., -Ejv drawn from a joint probability dis- 
tribution p(E±, E2, ■ ■ ■ , En). Then the minimum value, 
E m i n = xmn{Ei, E 2 , . . . , En} is also a random variable 
and one would like to know its probability distribution. 
Let, Pn{x) — Prob\E m i n > x] be the cumulative distri- 
bution of the minimum. Then clearly, 



Pn{x) 



poo poo N_ 

/ .../ p(E 1 ,E 2 ,...,E N )]]_dE l , (1) 
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since if the minimum is bigger than x, then each of 
the variables must also be bigger than x. When the 
variables are uncorrelated and each is drawn from the 
same distribution p(E), the joint distribution factorizes, 
p(E u E 2 , . . . , E N ) = p{Ei) . ..p(E N ) and from Eq. (§) 



one simply gets, Pn( x ) — [J^° p(E)dE] N . If the dis- 
tribution p(E) is unbounded and decays faster than a 
power law for large \E\, then one can show that for 
large N, Pn(x) approaches a scaling form S, Pn(x) = 
F[(x + aj\r)/6jv]- Here ajv and &/v are functions of N 
and depend explicitly on the distribution p(E), but the 
scaling function F(y) is independent of p(E) and N 
and has the universal super-exponential form, F(y) = 
exp[— exp(y)]. As a consequence, the distribution of 
the minimum P m i n (y) — —dFjdy — exp[y — exp(y)] has 
the universal Gumbel form. There are two other known 
universality classes when the distribution p{E) is either 
bounded or has algebraic tails for large \E\, but we will 
not be concerned with these cases in this paper. 

The question we focus on here is whether the Gum- 
bel law continues to hold if the random variables are 
unbounded but correlated. This question has recently 
been addressed by Carpentier and Le Doussal || who 
developed a renormalization group (RG) approach for 
logarithmically correlated variables. With logarithmic 
correlations they found that the cumulative distribu- 
tion function F(y) behaves (up to some rescaling fac- 
tors) as, F(y) = 1 — yexp(y) in the backward tail region 
y —>■ — 00. A pure Gumbel law would have predicted, 
F(y) — 1 — exp(y) as y — * —00. Thus the Gumbel law 
is indeed violated in this backward tail region. However, 
their RG approach can not predict whether the super- 
exponential forward tail of the Gumbel distribution still 
holds or not. The question we are interested in is whether 
strong correlations can also modify the super-exponential 
forward tail of the Gumbel distribution. If so, this has in- 
teresting consequence for the persistence of random vari- 
ables as we discuss below. 

The persistence of random variables, a subject that 
has generated a lot of recent interest j7j, is related to 
the distribution of the minimum in a simple way. For 
random variables each with zero mean, the persistence 
is simply the probability that all of them are positive 
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and is given by P/v(0) in Eq. (|j). For independent vari- 
ables, it follows trivially from Eq. ([!]) that P/v(0) de- 
cays exponentially with N, Pat(O) = exp(— ON) where 
6 = — log[/ p(E)dE]. For correlated variables, this 
problem has been studied for many decades by applied 
mathematicians who call it the 'one sided barrier' prob- 
lem |^,[|. It is well known that Pjv(O) is hard to compute 
analytically even for Gaussian correlated variables, i.e., 
when the joint distribution p(E±, E2, ■ ■ . , En) is a mul- 
tivariate Gaussian distribution || . If the Gaussian 
variables are arranged on a line and if the correlation be- 
tween two variables -E 1 , and Ej decays faster than 
then Pn(0) is known to decay as Pv(0) ~ exp(-ON) for 
large N [|9| , where the persistence exponent is nontrivial 
and is known exactly only in very few special cases § . It 
would thus be interesting to know if strong correlations 
can modify this exponential decay of the persistence for 
large N. 

In this paper, we show that the two issues, (a) the pos- 
sibility of a non-Gumbel forward tail of the distribution of 
the minimum and (b) the possibility of non-exponential 
decay of persistence, are related to each other for random 
variables that are hierarchically correlated. The hierar- 
chical nature of the correlation allows us to derive exact 
asymptotic results for both the quantities. Our main re- 
sults are twofold: (i) For the distribution of minimum 
value, we show that the super-exponential forward tail 
of the Gumbel law is violated under generic conditions 
and (ii) as a consequence, the persistence is anomalous, 
i.e., Pjv(O) does not decay exponentially under the same 
generic conditions. 

We consider, as a model, the well studied problem of 
a directed polymer on a tree. This problem was first 
studied by Derrida and Spohn pT| ] , who were mostly in- 
terested in the finite temperature phase transition in this 
model. Here we focus explicitly on the zero temperature 
properties. We consider a tree rooted at O (see Fig. 1) 
and a random energy ej is associated with every bond 
of the tree. The variables Cj's are independent and each 
drawn from the same distribution p(e). A directed poly- 
mer of size n goes down from the root O to any of the 
2" nodes at the level n. Thus, there are N = 2 n possible 
paths for the polymer of size n and the energy of any of 
these paths is given by, 



E, 
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The set of N — 2™ variables E\, E 2 , ■ . ., En are clearly 
correlated in a hierarchical (i.e. ultrametric) way and 
the two point correlation between the energies of any 
two paths is proportional to the number of bonds they 
share. We would then like to know the distribution of 
the minimum energy. 




FIG. 1. Each bond of a Cayley tree has an energy u- 

Clearly, Pn(x) = -Profe[P min > x] is also the probabil- 
ity that all the N paths up to the n-th level have ener- 
gies > x. Since N = 2™, let us write, for convenience, 
R n {x) = Pn{x). It is easy to see that R n {x) satisfies the 
recursion relation, 



Rn+l(x) 



dep(e)R n (x - e) 



(3) 



with the initial condition, Ro(x) = 9{—x) where 0(x) is 
the usual Heaviside step function. This relation is de- 
rived by considering various possibilities for the energies 
of the two bonds emerging from the root O and taking 
into account that the two subsequent daughter trees are 
statistically independent. The Eq. @ was studied in de- 
tail in Ref. for several distributions p(e)'s with non 
negative support. In particular, for the bivariate distri- 
bution, p(e) = pS(e — 1) + (1 — p)5(e), the solution of Eq. 
(^) was shown to undergo a depinning phase transition 
at p c = 1/2 p2| . Since in this paper we are mostly in- 
terested in the persistence of the E% variables, we restrict 
ourselves subsequently only to symmetric distributions 
p(e) with zero mean. Defining R n (x) = Q^(x), Eq. (||) 
can be recast into, 



i(x) 



dep{e)Q 2 n (x - e), 



(4) 



9(— x) and the bound- 
00 and Q n {x) — > 1 as 



with the initial condition, Qo(x) 
ary conditions, Q n (x) — > as x 
x — > -co. 

The Eq. (Q) is known jl^] to admit a traveling front 
solution, Q n {x) = q(x + vn) where the front propagates 
in the negative x direction with a constant velocity v as 
n increases (see Fig. 2). Substituting Q n {x) — q(x + vn) 
in Eq. (H), we get 



dep(e)q 2 {y - v - e), 



(5) 
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with the boundary conditions, q(y) — * 1 as y — > — oo and 
q(y) — > as y — > oo, with the front located around y = 0. 
The velocity v can then be determined exactly by ana- 
lyzing the backward tail region, y — > — oo of the function 
q(y). In this regime, substituting q(y) = 1 — g(y) in Eq. 
(|J) and neglecting the terms of 0(g 2 ), we find that the 
resulting linear equation admits an exponential solution, 
g(y) — aexp(Ay) with a > provided v is related to A 
via the dispersion relation, 



v = -r log 



dep(e)e 



-Xt 



(6) 



For generic distributions p(e), the function «a has a 
unique minimum at A = A* and by the general veloc- 
ity selection principle |13||, t his minimum velocity, v\* is 
selected by the front |ll| , |l2|] . 

Thus the cumulative distribution of the minimum en- 
ergy approaches a scaling form for large N, Pn{%) — 
R n (x) = Q 2 (x) — > q 2 [x + log N] , where the function 
q(y) is given by the solution of Eq. (||) and v\* is de- 
termined by minimizing Eq. (^). The question we are 
interested in is: what is the asymptotic form of q(y) for 
large yl We show below that that for any bounded dis- 
tribution /9(e), the function q(y) for large y indeed has 
the Gumbel shape, q{y) — » exp[— ci exp(c2y)] where ci 
and C2 are positive constants. On the other hand, for un- 
bounded distributions p(e), the Gumbel law breaks down 
and asymptotic forward tail of q(y) is nonuniversal and is 
determined explicitly by the distribution p(e) . For exam- 
ple, for the exponential distribution, p(e) = exp[— |e|]/2, 
we find exactly q{y) — ► exp(— y) for large y. For a generic 
unbounded distribution, one can prove a lower bound, 
q(y) > f(y) for lar S e 2/> where f(y) = f°° p{e)de. 




FIG. 2. The traveling front for the function Q n (x) for 
n — 10 (the solid line), 20 (the dashed line) and 30 (the 
dotted line) for exponential distribution p(e) = exp[— |e|]/2. 
In the inset, we plot the logarithm of the collapsed scaling 
function q(x + vn) (for different n). The scaling function q(y) 
evidently has an exponential tail for large y. 

We first focus on the unbounded distributions p(e). 
Let us first consider the exponential distribution, p(e) = 
exp(— |e|)/2. In this case, by first making a change of 
variable e — > y — v — e inside the integrand on the right 
hand of side of Eq. (|J) and then differentiating twice the 
resulting equation, we get 



^ = q{y) 



q 2 (y-v). 



(7) 



For large y, clearly the nonlinear term is negligible since 
q{y) is small. Using the boundary condition q{y) — > as 
!/^oowe then get, q{y) — > Aexp(—y) for large y where 
A is a constant. Thus we get an exponential forward 
tail instead of the standard super-exponential forward 
tail of the Gumbel distribution. Note that the velocity 
v is determined, as before, from the y — > — oo tail where 
q(y) = l-ae Xy and Eq. (§) gives, v x = log[2/(l - A 2 )]/A 
with < A < 1, in accordance with the general for- 
mula in Eq. (^|). The function v\ has a unique min- 
imum at A* = 0.603582 . . . and the chosen front ve- 
locity is then, u A . = 1.89899.... In Fig. (2), we 
show that Q n (x) indeed approaches the scaling form, 
Qn(x) — > q(x + v\*n) and the tail of the scaling func- 
tion is given by, q(y) ~ exp(— y) (see the inset of Fig. 
(2)) as predicted analytically. 

For a generic unbounded distribution it is difficult to 
derive exact results. However, one can easily derive a 
lower bound for q(y). From Eq. (^|), it is clear that 
q(y) > fy°l v dep(e)q 2 (y — v — e). This follows since the 
integrand on the right hand hand side of Eq. (||) is al- 
ways positive. Since the function q{y) saturates to 1 very 
quickly for negative y, we can replace q 2 (y — v — e) by 1 on 
the right hand side of the above lower bound. This gives, 
for large y, q(y) > f(y) where f(y) = p(e)de. For ex- 
ample, for the Gaussian distribution, p(e) = e~ e / 2 /V2n, 
this result indicates that q(y) should decay at most as 
fast as f(y) = erfc(y/\/2)- Thus, for generic unbounded 
distributions, the forward tail of the function q(y) for 
large y is highly nonuniversal and is generally different 
from the super-exponential forward tail as in the Gumbel 
distribution. 

Next we consider the bounded distributions p(e). The 
lower bound discussed in the previous paragraph contin- 
ues to hold for bounded distributions as well, though for 
large y it trivially becomes zero for distributions with 
an upper cutoff. To obtain more precisely the behavior 
of q{y) as y — > oo, we first consider a specific example, 
p(e) = aS{e+l)+aS(e-l) + (l-2a)S(e) with < a < 1/2. 
The Eq. (||) then becomes, 
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q(y) = aq 2 {y- 



l) + aq 2 (y-v + l) 



+ (l-2a)q 2 (y-v), 



(8) 



where the velocity v = v\* is obtained by minimizing Eq. 
@ with respect to A. In this particular case, we get from 
Eq. (§), 

v x = - log [4a cosh(A) + 2(1 - 2a)] , (9) 
A 

which has a unique minimum at A = A* (a) for all 
< a < 1/2. We then need to analyze the large y be- 
havior of q(y) in Eq. (|J) with v — v\* . Note that as one 
increases y from — oo, q(y) remains approximately 1 up 
to the back edge of the front at y — and then starts de- 
creasing to as y increases beyond 0. The idea would be 
to determine q(y) for a fixed large y by iterating Eq. (||) 
backwards in y till we reach the back edge of the front at 
y = where q(y) rs 1. Anticipating a super-exponential 
decay of q(y) for large y, one can neglect the second and 
the third term on the right hand side of Eq. (||) and iter- 
ate the equation retaining only the first term. Iterating 
m times backward we get, 



q(y)-a 2m ' 1 [q (y - m(v + l))f 



(10) 



How many iterations do we need to reach starting from 
a fixed large yl Clearly the required value of m is given 
by, m = y/(v + 1) so that the argument of the function 
on the right hand side of Eq. ([H]) becomes 0. Using 
g(0) « 1, we get from Eq. ( |l0| ) the large y behavior, 

2 w/(«+i) 



(ii) 



confirming the super-exponential forward tail of q(y) and 
also justifying, a posteriori, the neglect of the second 
and the third term in the iteration of Eq. ( p[). We 
have verified the analytical prediction in Eq. (|11|) by 
direct numerical integration of Eq. (Q) with p(e) = 
aS(e + 1) + ad(e - 1) + (1 - 2a)5{e) for a = 1/4 (see 
Fig. §))■ 
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FIG. 3. The traveling front for the function Q n {x) 
for n = 100 (the solid line), 150 (the dashed 
line) and 200 (the dotted line) for the distribution 
p(e) = aS(e + 1) + aS(e - 1) + (1 - 2a) 5(e) with a = 1/4. 
In the inset, we plot log(— log(a(y))) against y which clearly 
shows the super-exponential decay of the scaling function q(y) 
for large y. 

The argument leading to the result in Eq. ( |rT| ) above 
uses only the fact that distribution p(e) has an upper cut- 
off at e = 1. Thus we expect that q(y) will always have a 
super-exponential forward tail as long as the distribution 
p{e) is bounded with an upper cutoff A. Let us consider 
another example of a bounded distribution, namely the 
uniform distribution, p(e) = [6(e + 1) — 8(e — l)]/2. In 
this case, we get from Eq. (||), 



1 



y — v-\-l 



dzq 2 (z). 



(12) 



V—v—l 



Differentiating the Eq. (O) with respect to y yields, 



dq _ 1 
dy~ ~ 2 



q 2 (y-v + l)--q 2 (y-v-l). (13) 



Again we anticipate that q{y) will have a super- 
exponential tail for large y. If so, one can make the 
approximation, log(—dq/dy) w log(g(y)). Using this in 
Eq. (|l^) we iterate the equation backwards as before af- 
ter dropping the first term on the right hand side of Eq. 
(|l3|). Using the same line of arguments used in the pre- 
vious paragraph, we finally get a super-exponential tail 
for large y as before, 



2 »/(« + D 



(14) 



Note that the velocity v in Eq. (jlj) has to be determined 
by minimizing Eq. (0) with a uniform distribution. Thus, 
in general, for any bounded distribution, we expect that 
for large y 



q(y) sa exp 



(15) 



where the constants c and v depend explicitly on the dis- 
tribution p(e). 

Having established the forward tail behavior of 
Rn{x) — q 2 (x + vn), we now turn to the persistence. 
The persistence is simply given by Pjv(0) = R n (0) — 
Ql(0) = q 2 (vn), where N — 2". Thus for large N or 
equivalently for large n, the asymptotic behavior of per- 
sistence Pjv(0) = q 2 (vn) is governed by the forward tail 
of the function q(y) for large y. Let us first consider the 
bounded distributions. In this case, using the result from 
Eq. (jl^) for q(y), we get the following exact result for 
persistence for large N = 2 n , 
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Pn(0) = Ql(0) = q 2 (vn) « exp [-2cN a ] 



(16) 



where a = v/(v + I) and v is determined by minimiz- 
ing Eq. (^). Thus the persistence has an anomalous 
stretched exponential decay for large N instead of the 
standard exponential decay. We have verified this ana- 
lytical prediction by numerically integrating Eq. (|]) for 
different bounded distributions. In Fig. (4), we show the 
result for the distribution p(e) = a5(e + 1) + ad(e — 1) + 
(1 — 2a)5(e). In this case, a — v/(v + 1) where v is the 
minimum value of the dispersion relation in Eq. (||) and 
is clearly a continuous function of the parameter a. In the 
inset of Fig. (||), we compare the analytical prediction 
for the exponent a(a) = v/(l + v) with that obtained 
from the numerical integration for various values of a. 
The agreement is evidently very good. 
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FIG. 4. The function log(- log(Q„(0))), obtained from 
the numerical integration of Eq. (k|), is plotted against n for 
the distribution p(e) = a5(e + l) + aS(e — 1) + (1— 2a) 5(e) with 
a = 1/4. The linear increase with n confirms the stretched 
exponential decay of Pjv(0) = Qn(0) f° r large N — 2 n . In the 
inset is shown the value of a(a) calculated analytically (solid 
line) with that measured numerically by direct integration of 
Eq. (g) (circles). 

We next consider the unbounded distributions such as 
p(e) = exp[— |e|]/2. For this exponential distribution, us- 
ing the asymptotic behavior q(y) — Aexp(— y), we find 
that for large N, 

Pjv(0) = Ql(Q) = q 2 {vn) ~ exp[-2H ~ N~ , (17) 

where (3 — 2u/log2 with v, as usual, determined via 
minimizing Eq. (Q). Thus in this case, persistence 
again decays anomalously but now as a power law with 
a nonuniversal exponent (3. Again we verified this ana- 
lytical prediction numerically by directly integrating Eq. 
([|) with the exponential distribution (see Fig. (|5|)). For 
generic unbounded distributions, using the lower bound 



q(y) > f(y) for large y where f(y) = f™ p(e)de, we 
get -P/v(0) = q 2 (nv) > f 2 [vlogN/ log 2], again highly 
anomalous. 
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FIG. 5. The function log(Q n (0))), obtained from the nu- 
merical integration of Eq. m), is plotted against n for the 
distribution p(e) = exp( — |e|)/2. The linear decrease with n 
confirms the power law decay of Pjv(0) = Q^(0) for large 
N = 2" with the exponent f3 = 2u/log2 where v m 1.89. 

In summary, we have investigated in detail the dis- 
tribution of the minimum energy of a directed polymer 
on a Cayley tree. We have shown that the hierarchi- 
cal correlations between the energies of different paths 
have a considerable effect on the distribution of mini- 
mum energy depending on the distribution of bond ener- 
gies p(e). In the case of bounded distributions p(e) of the 
bond energies, we have shown that the forward tail has 
a super-exponential tail as in the Gumbel distribution. 
However, for unbounded distributions p(e) the forward 
tail is highly non universal and depends explicitly on the 
distribution p(e). This rich behavior of the forward tail 
of the minimum energy distribution is shown to lead to a 
variety of anomalous behavior for the persistence proba- 
bility -Pzv(O), ranging from a stretched exponential decay 
for bounded distributions p(e) to power law decay when 
p(e) is exponential. 
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